LONG ARITHMETIC PROGRESSIONS IN SETS WITH SMALL 

SUMSET 



ITZIAR BARDAJI AND DAVID J. GRYNKIEWICZ* 



Abstract. Let A, B (- Z he finite, nonempty subsets witli minA — min_B = 0, and let 

S(A B)-^ ^ if ACB, 
0(A,U) - <^ Otherwise. 

If maxB < max A < + |B| - 3 and 

(1) \A + B\<\A\+2\B\-3-S{A,B), 

then we show A + B contains an arithmetic progression with difference 1 and length 
+ -1. 

As a corollary, if ^ holds, max(_B) < max(yl) and either gcd(yl) = 1 or else gcd{A + 
B) = 1 and |yl + _B| < 2\A\ + \B\ — 3, then A + B contains an arithmetic progression with 
difference 1 and length \A\ + \B\ — 1. 



1. Introduction 

For a subset A C Z, we let diam A = max A — min^ denote its diameter and \A\ its 
cardinality. We let gcd*(A) = gcd{A — oq), where aQ £ A and gcd denotes the greatest 
common divisor. For A, B CI Z, their sumset is the set of all sums of one element from A 
and another from B: 

A + B = {a + b : a e A, b £ B}. 

Also, define 

6(A B) — i ^ X + A O B, for some x € Z, 
^ ' ' ~ \ otherwise. 

The study of the structure of subsets with small sumset has a rich tradition (see |9j and 

|12j for two texts on the subject). One classical result is the {3k — 4)-Theorem of Preiman 

[3] [2] [9] [12] , which states that if a set A of integers satisfies gcd* (A) = 1 and 

(2) \A + A\ < 3\A\ - 4, 

then the diameter of A is at most 2\A\ — 4. In other words, A is an interval with at most 
1^1 — 3 holes. Various generalizations to distinct summands were later found [Ij [7] [H] [6]. 
The latest result is from [B], and shows that if diam (A) > diam (B), gcd* (A) = 1 and 

(3) \A + B\ = \A\ + \B\-l + r < \A\ + 2\B\ - 3 - 6{A, B), 
then diam {A) < \A\ + r - I and diam (B) < min{|^|, \B\} + r - 1. 
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However, as has later become apparent, knowing that there are only a small number 
of holes is not always sufficient. In part, this is because there are many subsets of small 
diameter that nonetheless have large sumset. Working through examples, one quickly finds 
that, informally speaking, it is much more difficult for the holes in a subset A with small 
sumset (and correspondingly the holes in ^ + ^ as well) to occur in the interior of the set 
than near the boundary (namely, near the maximum or minimum element). However, there 
have been few results satisfyingly embodying this idea. 

One such result occurred as a lemma in a pair of papers of J. Deshouillers and V. Lev 
characterizing large sum- free set over TLj-pL [8j [1]. It's main consequence stated that if 
diam (A) < \\A\ — 1, then A — A contains an interval of length 2\A\ — 1 [U Lemma 3]. 
(Some other similar results were also given.) Those familiar with Preiman's Theorem (see 
[S] or [12]) may also recall that the existence of large multi-dimensional progressions plays 
an important role in its proof. Very recently, G. Preiman gave a very precise estimate for 
the length of an arithmetic progression that can be found in ^ -|- A when the sumset is so 
small as to satisfy ([2]), showing that there is always one of length at least 2\A\ — 1. The 
example 

A = {0,1,2,... ,k - r - l,k - r + l,k - r + 3,... ,k - r -1 + 2r}, 

for r = 0, 1, . . . , A; — 3, shows the bound on the arithmetic progression length to be best 
possible, while the example 



A=ih2 



U<^a; + l,x + 2, + 



for X > k + 1, shows that the assumption 1^4 + A| < 3\A\ — 4 from ([2]) is needed. The 
paper of Freiman also delved into the issue of where the holes could occur in A, but the 
other structural information is derivable from the bound on the length of the arithmetic 
progression. 

The goal of our paper is to extend this result of Freiman to pairs of distinct summands 
A and B. 

Theorem 1.1. Let A, B C be nonempty subsets with diami? < diam A < \A\ + — 3 

and 

(4) \A + B\ <\A\+2\B\-3- d{A,B). 

Then A + B contains an arithmetic progression with difference 1 and length \A\ + \B\ — 1. 

Using the previously mentioned structural result for small sumsets from [6], we obtain 
the following immediate corollary. Note gcd*(^) > 1 and gcd*(yl + B) = 1 trivially implies 
\A + B\> 2\A\ + \B\ - 2; see Theorem 1.2]. 

Corollary 1.2. Let A, B C Z be nonempty subsets with d'\am B < diam A and gcd*(^ + 
B) = 1. If 

(5) \A + B\<\A\ + 2\B\-3-6{A,B) 
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and either gcd*(j4) = 1 or 

(6) \A + B\ < 2\A\ + \B\ - 3, 

then A + B contains an arithmetic progression with difference 1 and length \A\ + \B\ — 1. 

We remark that the condition gcd*{A + B) = I is simply a normahzation hypothesis; 
if instead gcd*{A + B) = d and ([5]) and ^ hold, then the difference of the arithmetic 
progression becomes d. 

During the course of the proof of Theorem ll.H the structural consequences concerning the 
location of holes and such will become apparent in the series of propositions and definitions 
leading up to the proof of Theorem ll.il The paper concludes with a few additional remarks. 

2. Long Arithmetic Progressions 

Throughout this section, we assume ^, i? C Z are finite, nonempty subsets normalized 
so that 

(7) minj4 = mini? = 0, 
and with 

(8) M = maxA and iV = maxi?, 

(9) M > N, 

(10) l^ + ^l = \A\ + \B\-l + r, 

so that A is assumed to be the set with larger (or equal) diameter. As the problem is 
translation invariant, there is no loss of generality when assuming d?]). Note, in view of ([7]) 
and Q, that 

(11) 5{A, B) = l\i and only if ^ C B. 

For a, 6 G Z, we define [a, 6] := {x E Z | a < x < 6} C Z. For a set X and an interval 
[a,h] C Z, the number of holes of X in [a, b] is denoted by 

When [a, 6] is the default interval [min A", max X] , we skip reference to the interval, that is, 

= ^X,[minX,maxX]) 

and when we refer to a hole in X without reference to an interval, we simply mean an 
element x € [min A, max A] \ A. 

Observe, in view of ([7]), ([H]) and ([TU]), that 

(12) M = \A\ + hA-l, 

(13) N = \B\ + hB-l, 

(14) hA+B = M + N + l-\A + B\ = hA + hB-r. 
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Also remark that, using (jl2p . we can rewrite the condition diam A < 1^41 + — 3 in Theorem 
Oas /lA < 1^1 - 2 and the condition ^ as r < \B\ - 2 - 6{A, B). 

Proposition 2.1. If Ha < |^| — 1, then 

(15) [N,M]^A + B. 

Proof. Let x G [N,M]. Thus 

(x,0), (:e-1,1), ...,{x-N,N) 

are all representations (a, 6) oi x = a + h with a € [0, M] and 6 G [0, A^]. If x ^ ^ + B, then 
each of these + 1 pairs must either have the first element missing from A or the second 
element missing from B, whence Ha + hs > N + 1 = \B\ + hs (in view of ()13p ). But this 
contradicts < \B\ — 1. □ 

In view of Proposition 12. 11 we see that all holes in A + B lie in one of the disjoint intervals 
[0, A — 1] or [M + 1, M + A]. We refer to them as left and right holes, respectively. 

Since 0+B C A+B and 0+A C A+B, if x G [0, A-1] is a hole in A+B, then x must also 
be a hole in B and in A. Likewise, since M + B C A + B, if x + M G [M + 1,M + A] is a hole 
in then X must also be a hole in and since A+ A C A+B,iix+N G [M+1,M+A] 

is a hole in A + B, then x must also be a hole in A. 

In view of these observations, we make some definitions. 

• Holes x € [0,N — 1]\B for which x ^ A + B remains a hole in A + B are called left 
stable holes in B. 

• Holes X G [0, A — 1] \ ^ for which x ^ A + B are called left stable holes in A. 

• Holes X G [1, A] \ S for which x + M ^ A + B are called right stable holes in B. 

• Holes X G [M — N + 1, M] \ A for which x + N ^ A + B are called right stable holes 
in A. 

A stable hole in A is one which is either right or left stable, and likewise for B. All other 
holes (in either A or B) are called unstable. We let h\ and /i^ denote the respective number 
of stable holes in A and B, and we let and h^ denote the respective number of unstable 
holes in A and B. 

This classification of holes into ones which contribute to a hole present in A + B (the 
stable ones) and those which do not contribute to any hole m A + B (the unstable ones) 
will prove to be a very useful perspective. Note that a pair of stable holes xa and xb, one 
from A and one from B, can be associated to each hole x ^ A + B: Indeed, if x G [0, A — 1] 
is a left hole in A + B, then x must come from a left stable hole both in B and A, i.e., 

Xb = X ^ B and xa = x ^ A 

are both left stable holes. On the other hand, if x G [M + 1, M + A] is a right hole in A + B, 
then it must come from a right stable hole both in B and A, i.e., 

Xb = X — M ^ B and xa = x — N ^ A 
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are both right stable holes. 

We will later see that these mappings are invertible, i.e., that xa = UA for holes x, y ^ 
A + B implies x = y, and likewise xb = ys implies x = y. However, next we prove a very 
important proposition — the key observation used in the proof — which shows that if we have 
a left hole x ^ A + B, then there must be many holes in A n [0, x] and B n [0, x], with an 
analogous statement holding for right holes. 

Proposition 2.2. If x e [0,N] \{A + B), then 

(16) ^A,[o,x] + hB,[o,x] > a; + 1. 
If x + M & [M,M + N]\{A + B), then 

(17) hA\x+M-N,M] + ^B\x,N] > N - X + I. 

Proof. The proof is analogous to that of the previous proposition. If x E [0, A^], then 

(x,0), (x-1,1), ...,(0,x) 

are all representations (a, &) oi x = a + b with a E [0, M] and h G [0, A^] (in view of ([9])). 
If x ^ A + B, then each of these x + 1 pairs must either have the first element missing 
from A or the second element missing from whence ()16p follows. The argument for when 
X + M € [M, M + N\ is analogous, considering instead 

(M,x), (M-l,x + l), ...,(x + M-7V,A^). 

□ 

Next, we show that no hole in B can be both left and right stable. 

Proposition 2.3. Letx G [l,N]\B. IfhA < \B\-2, then either x G A+B orx+M G A+B. 

Proof. If both X ^ A + B and x + M ^ A + B, then applying both cases of Proposition 12.21 
yields 

+ 2 = (x + 1) + (TV - X + 1) < /iA,[0,x] + hB,[0,x] + hA,[x+M-N,M] + f^B,[x,N] 

< hA + hB + 2, 

where the second inequality follows by Now applying (fT3]) yields hA > — 1, contrary 
to assumption. □ 

The following shows there are also no holes in A which are both left and right stable. 

Proposition 2.4. Letx G [0,M]\A. IfhA < \B\-2, then either x G A+B orx+N G A+B. 
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Proof. If both X ^ A + B and x + N ^ A + B, then Proposition ETT] imphes x £ [M - N + 
1,N — 1], whence applying both cases of Proposition 12.21 yields 

M + 2 = X + l + {N-{x-M + N) + l) < /lA,[0,x] + hB,[0,x] + hA,[x,M] + hB,[x-{M~N),N] 

< hA + l + hB + M-N + 1, 

where the second inequality follows in view of Q- Now applying p3|) yields Ha > \B\ — 1, 
contrary to assumption. □ 

In view of Propositions 12.11 12.31 and 12.41 (and the relevant stability definitions) , we see 
that, when Ha < — 2, 

(18) h\ = h% = hA+B = hA + hs-r 

(19) hi = r-hs 

(20) /i^ = r-hA. 

where (fT8]l uses (fn|) . and where (fT9j) and (f20]) follow from (fTSj) by using the identities 
hB = h"^ + h'^B hA = + h^; moreover, xa = UA or xb = Vb implies x = y for holes 
X, y € [0, M + N]\{A + B), as previously alluded. Note xb = xa when x is a left hole, and 
that XA = Xb + (M — N) when x is a right hole. 

The next proposition is the trickiest part of the proof, showing that all left stable holes 
precede all right stable holes, so there is no overlap. 

Proposition 2.5. Suppose Ha < \B\ - 2 and r < \B\ - 2 - 5{A,B). If xb G [^,N] \ B 
is a left stable hole and ub G [0,^] \B is a right stable hole, then xb < Vb- Likewise, if 
XA £ [0, M] \A is a left stable hole and ua ^ [0, M] \A is a right stable hole, then xa < UA- 

Proof. If XA G [0, M] \ ^ is a left stable hole, i/a G [0, M] \ A is a right stable hole and 
XA > UA, then = x^ G [0, A^] \B is a left stable hole and vb = yA-{M -N) £ [{),N]\B 
is a right stable hole with xb > y_B, in view of xa > VA and Q. Therefore we see that 
it suffices to prove the first assertion in the proposition, as the second is an immediate 
consequence. 

To that end, assume xb G [0, A^] \ -B is a left stable hole and ub G [0, A^] \ i? is a right 
stable hole with xb > Ub- Note that xb = Vb cannot hold in view of Proposition 12.31 
Moreover, assume xb and i/b are chosen minimally, meaning that there are no stable holes 
z£ [yB + l,XB-l]\B. 

Applying both cases of Proposition 12.21 to xb and i/b + M, respectively, we find that 



B\+hB + {XB - VB + 1) 



= {XB + 1) + {N -VB + l) 

^ f^A,[0,XB] + ^B,[0,xb] + f^A,[yB+M-N,M] + ^B,[i/s,Af] 
< hA + hB + hA\yB+M-N,XB] + hB,[yB,XB] ' 



(21) 



where we use (jl3p for the first equality. 
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In view of the minimality of xb and yB, see that 

(22) hB,[y^,,^]<hl + 2, 

with equahty possible only if [ub + 1, xb — I] contains all the unstable holes in B. We also 
have the trivial inequality 

(23) hB,[yB,XB] < - VB + 1- 

If Hb + M — N > xb, so that /iA,[j/B+A/-Af,a;i3] = 0) then ([2T]) and ((23]) imply Ha > \B\, 
contrary to hypothesis. Therefore we may assume ub + M — N < xb, and now we also have 
the trivial inequality 

(24) hA,[ys+M-N,XB] <XB-yB + l-{M-N), 

with equality possible only if [i/b + M — N,xb] are all holes in A. 

Applying the estimates ([M]) and ([22]) in ^ and using ([HD and ([20]), we discover 
that 

(25) \A\-2-r <hB-hA. 
In view of ([91), ([l2]) and ([l3|), we have 

(26) hB-hA<\A\-\B\, 

with equality only possible when M = N. Combining (|26p and (|25|) yields 

(27) r>\B\-2, 

whence our hypothesis r < \B\ — 2 — 6{A,B) implies that r = \B\ — 2, that 5{A,B) = 0, 
and that equality held in all estimates used to derive (j27p . 

As a result, 6{A, B) = and ([II]) imply A ^ B; equality in ([26]) implies M = N; and 
equality in ([25]) implies equality holds in both ([2^ and ([22]) . whence [ub + M — N,xb] are 
all holes in A and [y^ + ^,xb — 1] contains all the unstable holes in B. 

Since ^ ^ i?, it follows that there exists z £ A with z ^ B. Since [i/b + M — N,xb] are 
all holes in A and M = A^, it follows that z ^ [i/b, xb]- Thus, since [ijb + 1, — 1] contains 
all the unstable holes in B, it follows that z ^ B is a stable hole in B. However, this means 
that either z + ^ A + B or z + M ^ A + B, which are both contradictions in view of z A 
and M = N, completing the proof. □ 

We are now ready to finish the proof of Theorem II. H which we will follow from the next 
proposition. 

Proposition 2.6. Suppose Ha < |-B| - 2 and r < [B[ - 2 - 6{A, B). Then 

J := [e + 1, M + c - 1] C ^ + 5, 
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where e is the greatest left stable hole in B (let e = —1 if there are no left stable holes) and 
c is the smallest right stable hole in B (let c = N + 1 if there are no right stable holes). 
Moreover, 

(28) \J\=M-l + {c-e) > \A\ + \B\-l + hA,[e+l,c+M-N-l]+hB,[e+l,c-l] 

> \A\ + \B\-1. 

Proof. In view of proposition l2.5l we have e < c. Consequently, by the definition of stabihty, 
and in view of Proposition I2.H we see that 

J := [e + 1, M + c - 1] O A + B. 

Note 

(29) \J\ = M - 1 + {c - e) = \A\ + hA - 2 + {c - e), 

using (I12p . It remains to estimate c — e. 

Applying both cases of Proposition 12.21 to e and c + M, respectively, we find that 

(30) e + l + N-c+l<hA + hB-s, 

where s = ^A,[e+i.c+Af-Af-i] + ^B,[e+i,c-i]- Prom (fT3]) and (fSOl) . it follows that 

\B\ + hB + 1 + e - c < hA + hs - s, 

yielding 

c-e>\B\ + l-hA + s. 
Combining the above estimate for c — e with (j29|) , we obtain 

\J\ = \A\ + hA-2 + {c-e) > \A\+hA-2 + {\B\ + l-hA + s) 

= \A\ + \B\-l + s>\A\ + \B\-l, 

completing the proof. □ 



Finally, we complete the proof of Theorem II. 1[ 

Proof of Theorem \l.l\ We may w.l.o.g. assume minA = mini? = 0. Since diami? < 
diam^ < |^| + |5|-3, we have /i^ < |B| -2 in view of Since \A+B\ := |^| + |5|-l+r < 
1^1 + 2\B\ - 3 - 6{A,B), we have r < \B\ - 2 - 5{A,B). Thus applying Proposition Ej] 
completes the proof. □ 
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3. Concluding Remarks 

We conclude with some brief remarks, for which we assume the notation of the previous 
section, particularly concerning Proposition 12.61 

First, let us show that all the intermediary work and propositions leading up to Theorem 
11.11 save Proposition 12.21 are easily deduced from Theorem 11.11 itself, li J = [a,b] C A + B 
is the arithmetic progression with difference 1 given by Theorem ll.lt then observe that 

{A U [a, 6 - A^]) + {BU[a,b-M]) = A + B. 

From this observation, the apparently stronger bound given by Proposition 12.61 is now easily 
derived from Theorem 11.11 Additionally, if J does not contain the interval \N, M\ , then it 
follows that 

|J| < M = |A| - 1 < 1^1 + 1^1 - 3-5(^,5), 

contrary to Theorem II. 1[ Thus Proposition 12.11 is a consequence of Theorem 11.11 Noting 
that the hypothesis Ha < \B\ — 2 — 5{A,B) implies (in view of ([9]) and dH)) that hs < 
|^|— 2— B), it is then easily derived from the existence of the long arithmetic progression 
J = [a,b] A + B , and a simple calculation, that all stable holes preceding a are left stable 
and that all stable holes following b — N or b — M, respectively for ^ or i3, are right stable. 
So the propositions concerning right and left stable holes also follow from Theorem 11.11 
This leaves only Proposition 12.21 as containing information additional to Theorem 11.11 as 
claimed. 

Next, it is important to note that Theorem 1 1 . 1 |/ Proposition 12.61 essentially shows that the 
sets A and B can be divided into left and right halves with each half behaving independently 
(with respect to the sumset A + B) of the other. Taking the left halves Al = An [0, e] 
and Bl = B D [0, e] and unioning each with a sufficiently long interval [e + l,x], where 
X > e + 1 + r, results in a pair of subsets whose sumset exhibits the same structural 
behavior on the left side as for the original sumset A + B. (The right side A + B can be 
independently studied in a similar manner.) 

In general, there are many possibilities for how the holes can be distributed m Al and 
Bl. However, if one wishes to use holes efficiently, that is, use a large number of holes 
relative to the maximal bound r, then (jl9p and (|20p show that the number of unstable 
holes must be small, which helps restrict the possibilities for Aj^ and Bj^. 

For instance, in the extremal case when there are no unstable holes in either A oi B, 
then we must have Al = Bl, and Al U [e + 1, oo) is the complement of the solution set of 
the Frobenius problem (see [10]) for the set A (i.e, U [e + 1, oo) = U/lii ^A, where hA = 
A + . . . + A denotes the /i-fold sumset). In particular, if di, d2 G Al, then the arithmetic 

h 

progression {di +id2 | i = 0, 1, 2, . . . , } is contained in U [e + 1, oo). In fact, Al 'is just the 
intersection of the multi-dimensional progression {iidi + i2'^2 + ■ ■ ■ + iidi \ ij = 0, 1, 2, . . .} 
with [0, e], where Al = {0, di,d2, ■ ■ ■ , di}. 
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